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Let be a 3D point in world coordinates, c be the location of the camera in that coordinate 
system, JR be a rotation matrix representing the orientation of the camera (see Figure Al), A 
local Cartesian camera coordinate system is specified by axes X-Y-Z, where the Z axis lies 
along the central viewing ray of the camera and X and Y are perpendicular to that ray. We 
represent points in the image using film coordinates (u , v) on a film plane displaced fi:om the 
optic center by a distance f, tfie focal Icngtfi. Note that film coordinates are related to image 
pixel coordinates through an affine transformation that is a function of linear camera intrinsic 
parameters, and also perhaps by an additional nonlinear transformation representing lens 
distortion. Worldng in film coordinates amounts to assuming a calibrated camera, for which 
intrinsic parameters and lens distortion are known and inv^ble. 

In homogeneous coordinates, we have the projection equation 

[uvf]'- R(W'-c) 

relating film coordinates to 3D world coordinates, where ^ denotes equality up to a scale factor 
and * denotes a vector or matrix transpose. 



Al : Camera rotation induces a 2D homography in film coordinates. 

Let [a\ vi f]' be film coordinates for the image taken with orientation jR/, and let (Ua V2 fy be 
film coordinates for the image taken with orientation R2, Hien 

(u, vif]' Riiw-c) 
and * 

[U2V2f]' - R2(W-^C) lf2i?/ [Ui V, f]' 

Therefore the change in camera orientation induces a 2D homography R2 Rx ' on points in the 
film plane, and the image can be transformed with no knowledge of scene structure. 



A2 : Change in zoom induces an isotroj^ic scaling of film coordinafes. 

Let [ui, vi, fi]' be film coordinates for (he image taken wth focal length fi, and let [U2, V2, f2]' 
be fihn coordinates for ttie image taken with focal length Then 

[u2V2f2r = (f2/fi)[u,vif,r 

and thus change in focal length induces an isotropic scaling of fibn coordinates by the 
magnification factor f2/f i . 



A3 : Translation Tz Is approximately an isotropic scaling of film coordinates 



Let [x,y^]' ^Riw-c) bea3D worid point p represented withia the camera coordinate 
system, and let [ui, vi, fj* = [y/z, x/z, f] beits fihn coordinates. If the camera is translated a 
small distance Tz along the camera's Z axis, then the new coordinates of jpr are [x,y;z-Tz]\ 
which map to fihn coordinates [ua, V2, f}' == [y/(z-Tz), x/(z-Tz), f), so that 

[U2Y23 = (z/(z-Tz)) Kvil ) 
and the point is transformed by an isotropic scaling of film coordinates. Although this is strictly 
trae only for this point, and other points at'sccne depth z, if the object we are looking at is 
"shallow" with respect to distance from the camera (variation in z coordinates is small with 
respect to magnitude of z), then this formula is approximately true for all odier points on the 
object as well. Therefore, transladon by ajsmall amount Tz towards or away from a distant 
object is approximated by an isotropic scaling of film ooordinafccs, 

O A4 : Translatioa Tx and Ty is approximately a 2D Image translation 

Let tx,y^]* = l?(K^-c)bea3D worid point p represented within the camera coordinate 
W system, and let [uu vi, = [y/z, x/z, f] be its film coordinates. If the camera is translated a 

M small distance Tx and Ty along the camera's X and Y axes, then the new coordinates of p are 

Ly v*ich map to fihn coorditfates (uo, v^, = [(y-Tyy/zX (x-Txyx, f), so diat 

3 [«2V2l = (ui-Ty/z Yi-Tx/z] 1 

Q and the point is transfonned by a 2D translation of film coordinates. Although this is strictly 

true only for this pomt, and oftar points af scene depth z, if the object we are looking at is 
1^ "shallow'* with respect to distance fiom the camera (variation in z coordinates is small with 

hi respect to magnitude of z), then this formula is approxunately true for all other points on die 

object as well Therefore, translation by a ^mall amount Tx and Ty perpendicular to the central 
yiewmg axis of the camera (the Z axis) caii approximated for distant objects by a 2D translation 
in film coordinates. 



AS : O)mposition of transformations * 

The separate transformations for changing orientation and zoom (or translation Tz) can be 
combined as « 

[UiVafd' - ifi/fORjEi'lniVifiY 

I 

A6 z Local parameterization of changing^orientation and translation 

Given an image, we do not necessarily Iari)w it's absolute orientation ft/ with respect to the 
world, nor the absolute orientation Rj of die desired image. We thus write fee homography for 
changing camera orientation in terms of the current, local camera coordinate. Note that 2D 
homography R2R1' has the form of a 3D rotation matrix jR (up to a scale factor). We can 
th«ef(TO specify Ae homography R in local camera coordinates as a rotation by angle 8 around 
jaxis<i = [abcj\ ByRodriguesfomiula,wecanwritcjRas 
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Rzzl+ sia8 A + (1 - cosG) 
whcie Jr is the identity matrix and A is the skew-symmetric matrix 



0 -c b 
c 0 -a 
-b a 0 



Note, for suffideatly small rotation angleJ 6. we can use the smaU angle approjdmations 
sine «6 and cos9 «1 to vrate 
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Comparing to the small angle approximation formula for a rotation matrix in tenns of Euler 
angles roU, pitch and yaw about the camera x. y and z axes, respectively (see Hgure AI), we see 
that for small angles: yaw==9a, pifcch«0b and toll^Qc. 

Similar to the case of "changing camera orientadon, we do not necessarily know the focal length 
f I of a given image, nor the focal length ff we would like it to have after a simulated camera 
zoom. Instead, we parameterize a zoom transfonnation in terms of a desired magnification 
factor m f2 / f I . 

In the case of translation by Tz along the central viewing ray, we do not know tiie distance z to 
object points, or the distance Tz of the translation, but we likewise can spcdfy the isotropic 
scaling induced by fiic translation as a magnification factor m. Furthermore, translation by Tx 
and Ty perpendicular to the viewing ray can be specified locally as 2D translation paranaeters du 
and dv, which are functions of the unknown object distance z and magnitude of translation 
con^nents Tx and Ty, 

A7 : Speda^g yaw and pitch correcfion by sdecfing one poiat corr^ondeuce. 

Here we show that correction of camera yaw and pitch* can be specified by selecting one point 
coxxespondence (Rgoie Point (u,v) is selected, and we wish to detOTnine the camera 
orientation correction that brings tfiis point to the crater of the image (u=0, v=0)- That is, we 
Ydsh to compute the transformation that rotates ray [u v f]' into alignment with ray (0 0 Q'. The 
unit vector axis of tlus rotation is 

[abc] = [u V f]' X[0 0 fl' = [v-uOr /sqrt(uW) 
and the clockwise angle of rotation is 
e = atan(sqrt(u W) / f) 
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from which we can write the 2D homography R using Rodrigues fonnula, above. By 
construction, the titiird component of the rotation axis is always zero, i.e, c=0, so that for small 
angles this is a conection in yaw and pitdi alone, with no roll angle conection. 

AS : E^cieat approximation for implemeatii^ yaw and pitdi correction 

A great implementation speedup for yaw and pitch coifcction is possible in the case that we 
have both small rotation angle and a large focal loigflu In this case, 6 ~ (sqrt(uVv^) / f) and 
we can use JR « I + 6 A. After some manipulation we find that the orientadon correction 
homography 

[U2V2f]' ^ jRCurvjfl' 

reduces to 

Uj ~ Ui~U 
Vz ~ Yi-V 

which is simply the image shift (translation) thai takes POI (u,v) to the center of the image, 

A9 : Specifying yaw, pitch, roll and Tz by selecting two point correspondences. 

Here we show that coirection of camera yaw, pitch, roll and Tz can be specified by selecting 
two point correspondences (Figure A3). As previously (case AT) we select one point as the 
POI, and determine the 2D homography R that brings (his point to the image center (u=:0,Y=0), 
In addition to the POI, we also choose a second point, called Vi to denote ^Vertical unit**, 
which wll be mapped CKie **unif vertically above (he image center. The definidon of what 
distance fi in pixels conesponds to I unit is specified by the user (c.g, 1 unit - ii =100 pixels). 

Assumed point VI is mapped to film coordinate (s,t) by homography R. The coordinate system 
must be rotated and isotropically scaled so that point (s,t) maps to point (0,jx) which is located 
vertically above die image center at distance pL It can easily be shown that this transfcHmadon 
can be written as 



tjii 


-SIX 








0 


sy. 


til 








0 


0 


0 


1 



Roll and Tz correcdon 5 can be composed with pitch and yaw correction R to yield a single 2D 
homography = 5 jR that performs pitch, yaw, loll and Tz conecdon. 



AlO : Efficient approxixnaUoa for changing yaw, pitch, rail and Tz 

When implementing for small angles and large focal lengths, the 2D homography H above can 
be simplified to a translation, rotation and isotropic scale, i,e, a single similarity transfonnation, 
lca<^ng to a great computational sa^dngs. The derivation is similar to the case A8, above. 

All : Specifying yaw, pitch, roll , Tx, Ty and Tz by selecting three point correspondences. 

Here we show that full correction of camera yaw, pitch, roll, Xx, Ty, and Tz can be specified by 
selecting tfaiee point cortespondences (Figure A4), As previously (case A9), we select two 
points to be the POI and VI, and compute the homography that corresponds to correcting yaw, 
pitdi, roll and Tz by bringing tte POI to the image crater (u=0,v=0) and VI to point (0,p) one 
vertical unit above the center. Now select a third point CO = ^,q) in this corrected image. We 
then want to apply the 2D translation tiiat maps the origin (0,0) to point CX)=(p,q), namely 

u2=:u + p 

v2 = v + q 

is the correction for Tx and Ty. Let this transforaiation be written as matrix T. Then the Tx and 
Ty correction can be composed with the previous pitch, yaw, roll and Tz collections (R and S) 
to form a single 2D homography H^TSR that performs yaw, pitch, roll, Tx, Ty, and Tz, 

A12 : Efficient approximation for changing yaw, pitch, roll, Tx, Ty and Tz 

When implementing for small angles and large focal lengths, the 2D homography H above can 
be simplified to a translation, rotation and isotrofnc scale, Le. a single similarity transformation, 
leading to a great computational savings. The derivation is similar to cases A8 and AlO, above. 
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Figure A3: Spedj^Iag yaw, pitch, roll and Tz by sdecdng two point corrcspoadeaces. 






Figure A4: Specifying yaw, pitch, roll, Tx, Ty and Tz by sdecting three points. 
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